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Abstract 


A search for Higgs boson production in association with aW or Z boson, in the WW* decay 
channel, is performed with a data sample collected with the ATLAS detector at the LHC in proton- 
proton collisions at centre-of-mass energies ^/s = 7 TeV and 8 TeV, corresponding to integrated 
luminosities of 4.5 fb~^ and 20.3 fb“L respectively. The WH production mode is studied in two-lepton 
and three-lepton final states, while two-lepton and four-lepton final states are used to search for the 
ZH production mode. The observed significance, for the combined WH and ZH production, is 
2.5 standard deviations while a significance of 0.9 standard deviations is expected in the Standard 
Model Higgs boson hypothesis. The ratio of the combined WH and ZH signal yield to the Standard 
Model expectation, ^vh, is found to be ^^VH = 3.0l};5 (stat.)!^;^ (sys.) for the Higgs boson mass of 
125.36 GeV. The WH and ZH production modes are also combined with the gluon fusion and vector 
boson fusion production modes studied in the H WW* decay channel, resulting in an over¬ 

all observed significance of 6.5 standard deviations and /^ggF+VBF+VH = l-16loj5(stat.)+o;}5(sys.). 
The results are interpreted in terms of scaling factors of the Higgs boson couplings to vector bosons 
{kv) and fermions {up)', the combined results are: \kv\ = 1-061o;Jq, 
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1 Introduction 


In the Standard Model (SM) of fundamental interactions, the Brout-Englert-Higgs [1-3] 
mechanism induces the electroweak symmetry breaking that provides mass to elementary 
particles. The mechanism postulates the existence of an elementary scalar particle, the 
Higgs boson. The ATLAS and CMS collaborations at the CERN Large Hadron Collider 
(LHC) have observed the Higgs boson with a mass {mu) of about 125 GeV [4, 5]. The 
measurements of the Higgs boson couplings to SM particles, and its spin and CP quantum 
numbers, are essential tests of the SM [6-12]. Higgs boson production in association with 
a,W or Z (weak) boson, which are respectively denoted hy WH and ZH, and collectively 
referred to as VH associated production in the following, provides direct access to the 
Higgs boson couplings to weak bosons. In particular, in the WH mode with subsequent 
if—> WW* decay, the Higgs boson couples only to W bosons, at both the production and 
decay vertices. 

Searches for VH production have been performed at both the Tevatron and LHC 
colliders, in events with leptons, 6-jets and either missing transverse momentum or two 
central jets. Evidence for VH production has been recently reported in the Tevatron 
combination [13] while no VH production has been observed so far at the LHC [14-20]. 

In this paper, a search for Higgs boson production in association with a weak boson, 
followed by H^WW* decay, is presented. The data were collected in 2011 and 2012 by the 
ATLAS experiment at centre-of-mass energies of ^/s = 7 TeV and 8 TeV, respectively. In 
the SM, for mn = 125 GeV, the cross sections of the WH and ZH associated production 
modes, followed by the H^WW* decay, are 0.12 pb and 0.07 pb at ^/s = 7 TeV and 
0.15 pb and 0.09 pb at ^/s = 8 TeV [21], respectively. Four topologies are considered, with 
two, three or four charged leptons in the final state (only electrons or muons are considered). 
The analyses are optimised to search for both the WH and ZH production modes; a 
combined result for VH is also presented. The VH results are then further combined 
with the 77—)- WW* —>• ii/iu analysis of gluon fusion (ggF) and vector boson fusion (VBF) 
production, for which the ATLAS Collaboration has reported the observation of the Higgs 
boson in the H^WW* decay channel with a signihcance of 6.1 standard deviations [22]. 

The combination of the ggF, VBF and VH analyses, presented in this paper, is used 
to determine the couplings of the Higgs boson to vector bosons and, indirectly, to fermions, 
providing further constraints on the Higgs boson couplings. 

2 Analysis overview 

Higgs boson production in association with a IV or Z boson, followed by 77—)■ IVIV* decay, 
is sought using events with two, three or four charged leptons in the final state. Leptonic 
decays of r leptons from 77—)• IVW* —)• rz/ri/ are considered as signal, while no specific 
selection is performed for events with hadronically decaying r leptons in the final state. In 
the present analysis events from V77(77 —)> rr) are considered as background. The analysis 
is designed to select events which are kinematically consistent with the V77(77—)• WIV*) 
process, in order to enhance the signal-to-background ratio. Figure 1 illustrates the relevant 
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Figure 1. Tree-level Feynman diagrams of the VH{H^ WW*) topologies studied in this analysis: 
(a) 4£ channel (b) 3£ channel (c) opposite-sign 2£ channel and (d) same-sign 2£ channel. For charged 
lepton external lines, the directions of arrows refer to the superscripted sign. Relevant arrows are 
assigned to the associated neutrino external lines. 


tree-level Feynman diagrams of the studied processes, in which a Higgs boson is produced 
in association with a weak boson. 

Four channels are analysed, defined as follows: 

(a) 4£ channel (figure 1(a)): The leading contribution consists of a process in which a 
virtual Z boson radiates a Higgs boson, which in turn decays to a VF boson pair. 
The decays of the weak bosons produce four charged leptons and two neutrinos in 
the final state. The lepton pair with an invariant mass closest to the Z boson mass 
is labelled as (£ 2 , £ 3 ), while the remaining leptons are labelled as £0 and £1 and are 
assumed to originate in the )• WW* decay. The main backgrounds to this channel 
are non-resonant ZZ* and ZWW* production. 

(b) 3£ channel (figure 1(b)): The leading contribution consists of a process in which a 
virtual W boson radiates a Higgs boson, and the Higgs boson decays to a VF boson 
pair. All the weak bosons decay leptonically producing three charged leptons and 
three neutrinos in the final state. The lepton with unique charge is labelled as £q^ 
the lepton closest to £3 in angle is labelled as £ 1 , and the remaining lepton is labelled 
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as i 2 - Leptons and ii are assumed to originate from the H^WW* decay. The 
most prominent background to this channel is WZjW^* production; non-resonant 
WWW* production is also a significant background having the same final state as the 
signal. Other important backgrounds are ZZ* ^ Z 7 , Z+jets, tt and Wt production, 
as they pass the signal selection if a lepton is undetected or because of a misidentified 
or non-prompt lepton from a jet. 

(c) Opposite-sign 2i channel (figure 1(c)): The leading contribution consists of a pro¬ 
cess in which the weak boson V, which radiates the Higgs boson, decays hadronically 
and produces two energetic jets, while W bosons from the LT—)• WW* decay produce 
two oppositely charged leptons, labelled as io and £ 1 , and two neutrinos. The WH 
process is expected to account for 70% of the signal yield, while the ZH process 
accounts for the remaining 30%. After requiring two leptons of different flavour, the 
leading backgrounds for this channel are tt and Wt processes. Other major com¬ 
ponents are Z ^ tt and WW production with two associated jets. Final states 
including VF-|-jets and multijets may produce misidentified leptons, contaminating 
the signal region. Other background sources include WZjW'y* production and other 
Higgs boson production and decay modes, especially ggF production. 

(d) Same-sign 2£ channel (figure 1(d)): The leading contribution consists of a process 
in which a W boson radiates the Higgs boson, and then decays leptonically. The 
radiated Higgs boson decays to two W bosons, one decaying hadronically and the 
other, with the same charge as the first lepton, decaying leptonically. The final state 
therefore contains two leptons with same charge, labelled as l\ and £ 2 , two neutrinos 
and two energetic jets. Significant backgrounds in this channel are WZjW^* ^ IT 7 
and IT-l-jets production; WW, Z-|-jets and top-quark processes also contribute to this 
final state. Due to the overwhelming background from tt production, the selection 
is not optimised for events in which the lepton from the Higgs boson decay and the 
lepton from the associated W boson have opposite charges. 

All the channels described above are mutually exclusive due to the respective number of 
leptons with transverse momentum, pTj greater than 15 GeV. To maximise the analysis 
sensitivity to the VH[H^ WW*) process in each of these decay modes, the data samples 
for each topology, except for the opposite-sign 2 £ channel, are further subdivided into sev¬ 
eral signal regions (SRs). Additional kinematic regions, with orthogonal selection criteria, 
designated as control regions (CRs), are used to normalise the major backgrounds in each 
SR by extracting normalisation factors. 

The final results are extracted from a fit that simultaneously considers all SRs and 
CRs. The 41 channel is split into two samples according to the number of same-flavour 
opposite-sign (SFOS) lepton pairs, namely 4G2SFOS and 4£-lSFOS. The sample containing 
two SFOS pairs suffers from a higher background contamination than the sample with one 
SFOS pair. The 3i analysis requires at least one opposite-charge lepton pair, therefore 
the 3i system must have total charge of ±1. This analysis separates events with three 
same-flavour (SF) leptons, one SFOS lepton pair and zero SFOS lepton pairs, which have 
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different signal-to-background ratios. For the 3£-3SF and 3£-lSFOS channels a multivariate 
analysis is performed. The same-sign 2^ sample is divided into two sub-channels with one 
or two selected jets in the final state, namely 2t'-SSljet and 2£-SS2jet. The channel with 
two leptons of different flavour and opposite sign is denoted by 2t'-DFOS in the following 
sections. 

3 The ATLAS detector 

ATLAS [23] is a multi-purpose particle physics detector with a forward-backward symmet¬ 
ric cylindrical geometry^ and close to dvr coverage in solid angle. It consists of an inner 
tracking detector (ID) surrounded by a thin 2 T superconducting solenoid, electromag¬ 
netic and hadronic calorimeters, and a muon spectrometer (MS) incorporating three large 
superconducting toroid magnets, each with eight coils. 

The ID covers the pseudorapidity range \rj\ < 2.5 and consists of multiple layers of 
silicon pixel and microstrip detectors, and a straw-tube transition radiation tracker. The 
calorimeter system covers the pseudorapidity range |r/| < 4.9. Within the region |r/| < 3.2, 
electromagnetic calorimetry is provided by barrel and endcap high-granularity lead/liquid- 
argon (LAr) calorimeters. An additional thin LAr presampler covering |r/| < 1.8 is used 
to correct for energy loss in the material upstream of the calorimeters. Hadronic calorime¬ 
try is provided by a steel/scintillator-tile calorimeter, covering |r/| < 1.7, and two cop- 
per/LAr hadronic endcap calorimeters. The solid angle coverage is completed with for¬ 
ward copper/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic 
and hadronic measurements, respectively. The MS consists of separate trigger and high- 
precision tracking chambers that measure the deflection of muons in the magnetic field 
generated by superconducting air-core toroids. The precision chamber system covers the 
region \r]\ < 2.7 with three stations of monitored drift-tube layers, except for the forward 
region where the innermost station is equipped with cathode strip chambers. The muon 
trigger system covers the range |7y| < 2.4 with resistive plate chambers in the barrel and 
thin gap chambers in the endcap regions. A three-level trigger system is used. The first- 
level trigger is hardware-based, using a subset of the detector information, and reduces the 
event rate to less than 75 kHz. This is followed by two software-based trigger levels, which 
together reduce the event rate to about 400 Hz. 

4 Data samples 

The data were recorded using inclusive single-lepton and dilepton triggers. Overall quality 
criteria were applied in order to suppress non-collision backgrounds such as cosmic-ray 
muons, beam-related backgrounds, or noise in the calorimeters. The datasets used in the 

^ ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in 
the centre of the detector and the z-axis along the beam pipe. The x-axis points from the IP to the centre 
of the LHC ring, and the y axis points upward. Cylindrical coordinates (r, (f>) are used in the transverse 
plane, (j) being the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the 
polar angle 0 as rj = — lntan(0/2). 
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8 TeV and 7 TeV analyses correspond to an integrated luminosity of 20.3 fb“^ and 4.5 fb~^ 
respectively. The analysis of the 2.£-SS channel was performed only on the 8 TeV data 
sample, due to the low sensitivity of this channel. The 8 TeV data were taken at a higher 
instantaneous luminosity (£ ~ 7x 10^^ cm“^s“^) than that for the 7 TeV data (£ ~ 3 x 10^^ 
cm“^s“^) and with a higher number (~ 21 versus ~ 9) of overlapping proton-proton 
collisions, producing higher out-of-time and in-time pile-up [24]. The increased pile-up 
rate, rather than the increased centre-of-mass energy, is the main reason for the differences 
between 8 TeV and 7 TeV analysis selections. 

Table 1 lists the Monte Carlo (MC) generators used to model the signal and background 
processes. For the Higgs production processes the production cross section multiplied by 
the branching fraction of the i7—?■ WW* decay is shown, while for the background processes 
the production cross section, including effects of cuts applied at the event generation, is 
presented. The samples were simulated and normalised for a Higgs boson of mass mjj = 
125 GeV. The Vff samples were simulated with Pythia and normalised to the next- 
to-next-to-leading-order (NNLO) QCD calculations [21, 44-47] with additional next-to- 
leading-order (NLO) electroweak (EW) corrections computed with Hawk [48] and applied 
as a function of the transverse momentum of the associated vector boson. The gg —)■ ZH 
samples were simulated with Powheg-BoxI.O interfaced with pythia8 and normalised 
to the NNLO QCD calculations [45]. Associated Higgs boson production with a ti pair 
(ttH) is simulated with Pythia8 and normalised to the NLO QCD estimation [21, 44, 45]. 

The matrix-element-level calculations are interfaced to generators that model the par- 
ton shower, the hadronisation and the underlying event, using either PythiaG, Pythia8, 
Herwig with the underlying event modelled by Jimmy [49], or Sherpa. The CTIO parton 
distribution function (PDF) set [50] is used for the Powheg-Box and Sherpa samples 
while the CTEQ6L1 PDF set [51] is used for Alpgen and AcerMC samples. The Zj^* 
sample is reweighted to the MRSTMCal [52] PDF set. The simulated samples are described 
in detail in ref. [ 22 ] with a few exceptions that are reported in the following. 

The Zj'^* processes associated with light- and heavy-flavour (HF) jets are modelled by 
AlpgenJ-Herwig with merged leading-order (LO) calculations. The simulation includes 
processes with up to five additional partons in the matrix element, or three additional 
partons in processes with h- or c-quarks. An overlap-removal procedure is applied to avoid 
double counting of HF in the light-jet samples. The sum of the two samples is normalised to 
the NNLO calculation of Dynnlo [53, 54]. The ttWjZ and tZ backgrounds are simulated 
using Madgraph at LO interfaced with PythiaG. The production of four leptons from 
a pair of virtual Z or 7 bosons, indicated by ZZ* in the following, contributes to the 
background in the 2>i channel when one low-px lepton is not detected. Since this background 
is more prominent when one lepton pair has a very low mass, a dedicated sample which 
requires at least one SFOS pair with mu < 4 GeV, generated with Sherpa and normalised 
to the NLO QCD cross section from the parton-level MC program MCFM [55], is included. 
Production of triboson processes is a major source of background, in particular WWW* in 
the 3£ channel and ZWW* in the M channel. They are modelled by Madgraph interfaced 
with PythiaG and normalised to the NLO cross section from ref. [56]. 
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All samples are processed using the full ATLAS detector simulation [57] based on 
Geant4 [58], except for WH, WZjWj* with mu > 7 GeV, qq/qg WW, WWj*, ti 
and single top, which are instead simulated with Atlfast-II [59], a parameterisation of 
the response of the electromagnetic and hadronic calorimeters, and with Geant4 for other 
detector components. The events are reweighted to ensure that the distribution of pile-up 
observed in the data is correctly reproduced. 

5 Event reconstruction and selection 
5.1 Event reconstruction 

The primary vertex of each event is selected as the vertex with the largest value of 
where the sum is over all the tracks associated with that particular vertex. Furthermore, 
it is required to have at least three tracks with px > 400 MeV. 

Muons are reconstructed in the region jr/j < 2.5 by combining tracks reconstructed 
in the MS and ID [60]. This analysis uses muon candidates referred to as “Chain 1, CB 
muons” in ref. [60]. Electrons are identified within the region \r]\ < 2.47, except in the 
transition region between barrel and endcap calorimeters (1.37 < jr/j < 1.52), through the 
association of an ID track to a calorimeter cluster whose shower profile is consistent with 
an electromagnetic shower [61]. Electron identification uses information from both the 
calorimetric and tracking system. In the 7 TeV analysis a cut-based approach is adopted 
while in the 8 TeV analysis a likelihood-based selection is also exploited as described in [62] . 
Eollowing that reference, in the 41 and 3i channels, electrons with px < 20 GeV are required 
to satisfy the “very tight” likelihood requirement, while electrons with px > 20 GeV are 
required to satisfy the “loose” likelihood requirement. In the 2i channels, electrons with px 
< 25 GeV are required to satisfy the “very tight” likelihood requirement, while electrons 
with Px > 25 GeV are required to satisfy the “medium” likelihood requirement. 

Both a track-based and a calorimeter-based isolation selection are applied to leptons. 
The isolation criteria are chosen to maximise the sensitivity to the VH[H^ WW*) process 
at mu = 125 GeV. The track-based isolation is built on the computation of the scalar sum 
of the Px of tracks associated with the primary vertex and inside a cone, constructed around 
the candidate lepton, of size AT? = 0.2 ^ and excluding the track of the candidate lepton. 
The calorimeter-based isolation uses the scalar sum of the transverse energies measured 
within a cone of Aii = 0.2, excluding the energy of the calorimeter cluster associated with 
the lepton itself. For the 8 TeV data the electron calorimeter-based isolation algorithm 
uses topological clusters [62], while for the 7 TeV data it uses calorimeter cells. Cell-based 
isolation is used for muons in the calorimeter in both the 8 TeV and 7 TeV analyses. The 
calorimeter and track isolation criteria differ between the 8 TeV and 7 TeV data samples 
and are not the same for all the channels. The upper bound of the calorimeter-based 
isolation energy varies from 7% to 30% of the lepton px, while the sum of the px of the 
tracks in the cone is required to be smaller than 4% to 12% of the lepton px, where tighter 
cuts are applied at low px- Less stringent isolation criteria on energy and px are required 

"Ait = y(Ar/)2 -h 
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for the 7 TeV data sample, due to the lower level of pile-up compared to the 8 TeV data 
sample. 

Jets are reconstructed from three-dimensional topological clusters [63] over the region 
\r]\ < 4.5 using the anti-fc* algorithm [64] with radius parameter R = 0.4. Jets are required 
to have px larger than 25 GeV except for the forward region, \r]\ > 2.4, in which the 
threshold is raised to 30 GeV. In order to suppress the contamination of jets from pile-up, 
the following selection is applied: the sum of the px of all tracks within AR = 0.4 of the jet 
axis and that of the subset of these associated with the primary vertex is computed. The 
ratio of the latter to the former is required to be larger than 0.5 (0.75) for the 8 (7) TeV 
data samples, for all jets with px < 50 GeV and jr/j < 2.4. 

The MVl 6-jet identification algorithm is used to tag jets containing a 6-hadron [65]. 
For 6-jets with \t]\ < 2.5 and px > 20 (25) GeV in the 8 (7) TeV data analysis, the selection 
has an efficiency of 85%, estimated using simulated tt events. It corresponds to a rejection 
of a factor of 10 against jets originating from light quarks or gluons [66, 67]. 

When two leptons are reconstructed within a cone of AR = 0.1, or a lepton and a jet 
are reconstructed within AR = 0.3, they are considered to be the same physical object 
and one of the two is removed. In the rare occurrence of an overlap between two leptons of 
the same flavour, the higher-px lepton is kept while the lower-px lepton is discarded. The 
muon is retained in the presence of an overlap with an electron, the electron is retained in 
the presence of an overlap with a jet, and the jet is retained in the presence of an overlap 
with a muon. 

Two variables describing the missing transverse momentum are employed in this study: 
one is calorimeter-based and the other is track-based. The former, which benefits from 
the large rapidity coverage of the calorimeter and its sensitivity to neutral particles, is 
referenced as [68]. The magnitude, E™®®, is used in the analysis selection. 

The quantity E™'®® is calculated as the negative vector sum of the momenta of muons, 
electrons, r leptons, photons, jets and clusters of calorimeter cells that are not associated 
with these objects (the “soft term”). In the 8 TeV analysis, to suppress the pile-up effect, 
the ratio of the scalar px sum of all soft term tracks associated with the primary vertex 
to the scalar px sum of all soft term tracks from all vertices is employed. This ratio is 
used to scale all soft-event contributions to E™®® [69]. The track-based missing transverse 
momentum measurement is used to reduce the effects of pile-up on the resolution of the 
calorimeter-based variant [70]. It is calculated as the vector sum of the transverse momenta 
of tracks with px > 500 MeV that originate from the primary vertex. This quantity is called 
p™'®®, and the analysis selections are applied to its magnitude, p™®®. In order to include 
neutral components in the calculation of p™^®® in final states with jets, the sum of track 
momenta in jets is replaced by their energy measured in the calorimeter. 

5.2 Event selection 

Events are required to contain a primary vertex. The four channels are further split into 
eight signal regions, designed to optimise the sensitivity to the VH{H^WW*) process, 
with a specific set of selections applied to define each signal region. The selection criteria 
rely on the number of leptons and their properties such as charge, flavour, px, and on 
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the number of jets and 6 -tagged jets and on the magnitude of the missing transverse 
momentum. Leptons with px > 15 GeV are selected and their number is used to divide the 
analysis in the various channels. Similarly the analysis channels are subdivided in categories 
according the number of selected jets. Of particular importance are the invariant masses 
and opening angles among the selected objects, most notably those of opposite-sign lepton 
pairs. The spin-0 property of the Higgs boson, in conjunction with the V-A structure 
of the weak interaction, results in a preference for a small opening angle of lepton pairs 
from the ff —)■ WW* —>■ £iy£iy decays. On the other hand, as described in section 2, major 
backgrounds often contain Z boson production or tt production which give rise to opposite- 
sign lepton pairs with a large opening angle. In the 2^-SS channel, the lepton originating 
from the Higgs boson decay is selected by minimising the invariant mass of the lepton and 
jet(s); cuts are then applied to the opening angle between this lepton and the closest jet 
in the transverse plane. The definitions of the signal regions used for each channel are 
summarised in table 2 and further detailed in sections 5.2.1-5.2.4. 

In all the Ai and 3i signal regions, events are recorded using inclusive single-lepton 
triggers, which are fully efficient for high lepton multiplicity signatures. For the 2i channels 
in 8 TeV data taking, dilepton triggers are also used. In all channels at least one lepton 
must match a candidate reconstructed at trigger level. This requires the leading lepton 
in an event to have px greater than 24 GeV in the 8 TeV data sample, and greater than 
18 GeV and 20 GeV for muons and electrons respectively in the 7 TeV data sample. Single 
lepton trigger efficiencies are measured with respect to offline reconstructed leptons using 
leptonic Z decays. The measured values are approximately 95% for electrons, 90% for 
muons in the endcap and 70% for muons in the barrel. 

5.2.1 Four-lepton channel 

Events in the M channel are required to have exactly four leptons. The px of leading and 
sub-leading leptons must be above 25 GeV and 20 GeV, respectively, and the px of each 
of the remaining two leptons must exceed 15 GeV. The total charge of the four leptons 
is required to be zero. Only events with at least one SFOS lepton pair are accepted, and 
events are assigned to the 4.^-2SFOS and df-lSFOS SRs according to the number of such 
pairs. 

In order to select final states with neutrinos, and p™'®® are required to be above 

20 GeV and above 15 GeV, respectively. In order to reduce the ttZ background, events 
are vetoed if they contain more than one jet. Top-quark production is further suppressed 
by vetoing events with any 6 -tagged jet with px above 20 GeV. The invariant mass of £2 
and £ 3 , is required to satisfy GeV (where mz is the mass of the 

Z boson), and the invariant mass of £0 and £ 1 , is required to be between 10 GeV 

and 65 GeV. This requirement on greatly reduces the contamination from ZZ^A 

production in events with two pairs of SFOS leptons. 

The sensitivity is improved by exploiting two additional variables. The variable 
denotes the difference in azimuthal angle between the two leptons from the Higgs boson 
candidate in the frame where the Higgs boson’s px is zero. The Higgs boson transverse 
momentum is approximated with ~ ~Px ~ Px*’ with p;^ ~ —Px if no jet is present. 
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The angular separation is required to be below 2.5 rad. The magnitude of the 

vector sum of the lepton transverse momenta, can discriminate against the main 

background, ZZ'^*\ which has no neutrinos. A cut requiring ptai > 30 GeV is introduced 
for the 4G2SFOS SR. In this signal region the invariant mass of the four leptons is required 
to be above 140 GeV to remove events from the H ^ ZZ* ^ M decay, which are the target 
of another analysis [17]. In the signal extraction through the fit explained in section 8.4, 
the 4G2SFOS and 4G1SFOS SRs enter as two separate signal regions. 

5.2.2 Three-lepton channel 

For the 3i channel, exactly three leptons with px > 15 GeV are required with a total charge 
of ±1. After this requirement, contributions from background processes that include more 
than one misidentified lepton, such as kF+jet production and inclusive bb pair production, 
are negligible. Events are then separated into the 3G3SF, 3t'-lSFOS and 3GOSFOS SRs, 
requiring three SF leptons, one SFOS lepton pair and zero SFOS lepton pairs, respectively. 

In order to reduce the background from tt production, events are vetoed if they contain 
more than one jet. The background from top-quark production is further suppressed by 
vetoing events if they contain any 6 -tagged jet with px > 20 GeV. In order to select final 
states with neutrinos, is required to be above 30 GeV and p™'®® above 20 GeV in the 
3G3SF and 3G1SFOS SRs. In the 3GOSFOS SR, E™®® or p™^®® selections are not imposed 
because the main backgrounds also contain neutrinos. The invariant mass of all SFOS 
pairs in the 3G3SF and 3G1SFOS SRs is required to satisfy \mu — mz\ > 25 GeV. This 
requirement suppresses WZ and ZZ* events, and increases the Z-|-jets rejection. 

A lower bound is set on the smallest invariant mass of pairs of oppositely charged 
leptons at 12 GeV in the 3G3SF and 3G1SFOS SRs, and at 6 GeV in the 3GOSFOS SR. 
In addition, an upper bound on the invariant mass of oppositely charged leptons is set 
at 200 GeV in the three signal regions. These selections reject backgrounds from HF and 
reduce the number of combinatorial lepton pairs from the WZjW^* process. The latter 
could indeed give larger mass values with respect to the WH process since it can proceed 
through the t- and u-channels, in addition to the s-channel, which is also present in IF77 
production. 

The angular separation is required to be smaller than 2 in the 37-3SF and 37- 

ISFOS SRs. This cut favours the Higgs boson decay topology relative to that of IFZ/VF 7 * 
events. 

In the 37-3SF and 37-lSFOS SRs, the shape of a multivariate discriminant based 
on a Boosted Decision Trees (BDT) [71], which produces a multivariate classifier (“BDT 
Score”), is used to achieve a further separation between signal and background. The main 
purpose of the multivariate classifier is to distinguish between the signal and the dominant 
IFZ/IF 7 * and ZZ* backgrounds, and the BDT is trained against these two background 
processes. The BDT parameters are chosen in order to ensure that there is no overtraining, 
i.e. that the BDT is robust against statistical fluctuations in the training samples. The 
BDT input discriminating variables which provide the best separation between signal and 
background are the px of each lepton, the magnitude of their vector sum, the invariant 
masses of the two opposite-sign lepton pairs R™®®, and p™'®®. In 
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the fit, the shape of the distribution of the “BDT Score”, divided into six bins, is used to 
extract the number of observed events in the 3t'-3SF and 3f'-lSFOS SRs, while the shape of 
the distribution of , divided into four bins, is used to extract the number of observed 

events in the 3£-OSFOS SR. In the other channels only the event yield in each signal and 
control region is used without shape information. 

5.2.3 Opposite-sign two-lepton channel 

In the 2t'-DFOS channel, exactly two leptons with px larger than 22 GeV and 15 GeV 
are required. Only opposite-sign ep final states are considered in order to reduce the 
background from Z-|-jets, WZ and ZZ events. A cut on the invariant mass of the lepton 
pair, > 10 GeV, is applied to reject combinatorial dilepton backgrounds. In order to 

select final states with neutrinos, is required to be above 20 GeV. These selections 

reduce the background processes that contain jets faking leptons. The presence of at least 
two jets with px > 25 GeV is required. The background from top-quark production is 
reduced by vetoing events if they contain any 5-tagged jets with px > 20 GeV. To reject 
the Z-|-jets production that leads to ep final states through Z —)■ rr decay, a requirement 
of m-rT < (m,^ —25 GeV) is applied, where rrirT is the dilepton invariant mass reconstructed 
using the collinear approximation [22], namely under the assumptions that the lepton pair 
originates from r lepton decays, the neutrinos are the only source of and they are 

collinear with the charged leptons. 

Upper bounds on the invariant mass of the lepton pair, < 50 GeV, and the 

azimuthal angular separation of the lepton pair, < 1.8 rad, are applied to enhance 

the Higgs boson signal relative to the WW, ti and lT-|-jets backgrounds. Requirements on 
the rapidity separation between the two leading jets, Ayjj < 1.2, and the invariant mass 
of the two leading jets, |mjj — 85 GeV| < 15 GeV, are introduced to select jets from the 
associated WjZ bosons. The central value of the selection interval is larger than the 
W boson mass in order to retain the acceptance for ZH production with Z ^ qq decay. 

The selection mx < 125 GeV is applied, where m-x is the transverse mass of the 
dilepton system and FI™®, defined as mx = \J{E^ + — |px^^ -|- E™®P, where 

The selections on Ayjj and rrijj make this channel orthogonal to 
the ggF-enriched Uj > 2 category in ref. [22], while orthogonality with respect to the VBF 
category is ensured by explicitly vetoing the BDT signal region of the VBF analysis [22]. 
In the fit the 2GDFOS channel enters as a single signal region. 

5.2.4 Same-sign two-lepton channel 

In the 2GSS channel, exactly two leptons with the same charge are required. Lower bounds 
on lepton px are set to 22 GeV and 15 GeV and both the same-flavour and different-flavour 
combinations are considered. A lower bound on is applied at 12 GeV for same- 

flavour lepton pairs and at 10 GeV for different-flavour lepton pairs. Despite the same- 
charge requirement, a wrong-charge assignment may allow background contributions from 
Z boson decays. Therefore a veto on same-flavour lepton pairs with ~^z\ < 15 GeV 

is introduced. 
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The 2^-SS2jet and 2£-SSljet SRs require the number of jets to be exactly two or exactly 
one, respectively. Events with 6 -tagged jets having pT > 20 GeV are discarded. The 
is required to be larger than 50 GeV in the 2GSS2jet SR and larger than 45 GeV in the 
2GSSljet SR. Additional cuts are applied to events in the 2GSS2jet and 2GSSljet SRs, on 
the following variables (see table 2 for details): the minimum invariant mass of a lepton and 
the jet(s) in the event, m™" the smallest opening angle between the lepton which 

minimises the above variable and a jet, the transverse mass of the leading lepton 

and the E™®®, = ^2 x PT,iead x E.^'®® x (1 - cos(())iead - (/'gmiss)), where PT,iead and 

0iead are respectively the transverse momentum and (p angle of the leading lepton. Lower 
values of and of favour Higgs boson decays relative to the major 

backgrounds. High values of help in reducing IT+jets background. The pT threshold 
for the sub-leading muon in the /i/U channel is increased to 20 GeV in both the SRs to 
suppress misidentified muons from fT-|-jets and multijet production. 

In the fit, the 2GSS2jet and 2GSSljet SRs are further split into four signal regions 
according to the combination of lepton flavours in each event: ee, e/r, /re and /x/r, where 
refers to the case in which the electron has leading pT while /re refers to the case in which 
the muon has leading px- This splitting is motivated by the expected differences in the 
background contributions, for example IT 7 , which is expected to be zero in the nfi channel 
but not in the other channels. 

5.2.5 Signal acceptance 

The number of expected VH{H^ WW* ) events surviving the event selections is presented 
for each channel in table 3. The total acceptance for WH{H —)• WW* —>■ iulv), WH{H —)■ 
WW* Ivqq) and ZH{H WW* Ivlv) is 3.7%, 0.3% and 1.9%, respectively. The 
analysis acceptance for the ZH{H —)• WW* ivqq) process is negligible. The acceptance 
is defined as the ratio of the number of events in the SRs to the number of events expected 
according to the branching fractions for the various processes. Associated Higgs boson 
production followed by the decay 77 —)■ rr cannot be completely isolated from the selected 
final states. Therefore the results presented in this paper, with the exception of Section 

8.5 for consistency of the analysed model, include this process as part of the background, 
with the production cross section {ayu) and Br(77 —)■ rr) fixed to the SM value. 
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the definition of For Pt, leptons in the 41 channel the three values listed above refer to the leading, sub-leading, and to the two remaining 

leptons, respectively. For Pt, leptons in the 2£ channel the two values listed above refer to the leading and sub-leading leptons, respectively. For 
PTjets the value in parentheses refers to forward jets (|ry| > 2.4). 











(a) 8 TeV data sample 


Channel 


U 

2 £ 

Category 

2SFOS 

ISFOS 

3SF 

ISFOS 

OSFOS 

DFOS 

SS2jet 

SSljet 

WH {H WW*) 

— 

— 

0.56 

1.4 

1.3 

1.5 

1.0 

1.8 

ZH {H WW*) 

0.21 

0.24 

0.17 

0.18 

0.15 

0.67 

0.02 

0.19 

VH [H WW*) 

0.21 

0.24 

0.73 

1.6 

1.4 

2.2 

1.0 

2.0 

(all categories) 

9.4 


(b) 7 TeV data sample 

WH [H WW*) 

— 

0.12 0.29 0.26 

0.21 

ZH [H WW*) 

0.023 0.021 

0.013 0.033 0.028 

0.075 

VH [H WW*) 

0.023 0.021 

0.13 0.32 0.29 

0.29 

(all categories) 

1.1 


Table 3. Number of expected VH{H—>- WW*) events in the signal regions, for ttih = 125 GeV, in 
the (a) 8 TeV and (b) 7 TeV data samples. 


6 Background modelling 

The background contamination in the signal regions results from various physics processes, 
each modelled by one of the following methods: 

• Pure MC prediction: rates and differential distributions (shapes) are extracted from 
simulation and normalised to the cross sections in table 1; 

• MC prediction normalised to data: rates are extracted from data in control regions 
but shapes are extracted from simulation; 

• Pure data-driven prediction: rates and shapes are extracted from data. 

Misidentified-lepton backgrounds {W +jets, multijets) in the 2£ channels are estimated 
by using a purely data-driven method, which utilises the rate at which a jet is misidentified 
as a lepton [22]. Table 4 summarises the method adopted for each process in each signal 
region. The labels “MC” and “Data” represent the pure MC prediction and the pure data- 
driven estimation. For backgrounds modelled by simulation with a normalisation factor 
(NF) computed using data, the names of relevant control regions are shown as defined in 
tables 5 and 6. The ratio of tt yields to tW yields is found to be compatible between all the 
CRs and associated SRs, thus only one NF is computed per CR for the “Top” category. 
The ggF and VBF productions of Higgs bosons are treated as background as discussed in 
section 8.4. 

Definitions of control regions in the 4f and 3i analyses are presented in table 5, and 
those defined in the 2i analyses are shown in table 6. The CRs are made orthogonal to 
the corresponding SRs by inverting some selections with respect to the SR definitions. 
Such selections are in boldface font in the tables and are further explained in the following 
sections. 
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Channel 

U 

U 

21 

Category 

2SFOS, ISFOS 

3SF, ISFOS, OSFOS 

DFOS 

SS2jet, SSljet 

Process 

VVV 

MC 

MC 

MC 

MC 

WZjW-i* 

— 

Zl-WZ CR, SCZjets CR 

MC 

2 i-wz CR 

ZZ* 

4CZZ CR 

M-ZZ CR, SCZjets CR 

MC 

MC 

OS ww 

— 

MC 

MC 

2i-WW CR 

ss ww 

— 

MC 

— 

MC 

Wj 

— 

— 

— 

2l-W-i CR 

Zj 

— 

M-Z-i CR 

MC 

MC 

Zh* 

— 

3£-Zjets CR, M-ZZ CR 

2^-ZTT CR 

2£-Zjets CR 

W-l-jets 

— 

— 

Data 

Data 

Multijets 

— 

— 

Data 

Data 

Top 

MC 

3CTop CR 

2£-OSTop CR 

2^-SSTop CR 


Table 4. Summary of background modelling, represents the triboson processes WWW*, 

ZWW*, ZZZ* and WW^*. “Top” processes include ti and single-top production dominated by 
tW with W ^ £v decay, as well as ttWiZ. Some backgrounds are normalised by rescaling the 
MC yields by the data-to-MC ratio measured in CRs. For these backgrounds the names of the 
most important CRs are listed. The symbol “—” denotes a negligible contribution to the total 
background in the signal region. 


Channel 

il 

U 

CR 

ZZ 

WZ 

ZZ 

Zjets 

Top 

Zj 

Number of leptons I 

4 

3 

3 

3 

3 

3 

Total lepton charge 

0 

±1 

±1 

±1 

±1 

±1 

Number of SFOS 

2 

2 or 1 

2 or 1 

2 or 1 

2 or 1 

2 or 1 




(ee/i or 



{(ifie or eee) 

Number of jets 

< 1 

< 1 

< 1 

< 1 

> 1 

< 1 

Number of &-jets 

0 

0 

0 

0 

> 1 

0 

(and/or) [GeV] 

— 

> 30 and > 20 

< 30 or < 20 

< 30 and < 20 

> 30 and > 20 

< 30 or < 20 

\ma - mz\ [GeV] 

< 10(m<jr3) 

< 25 

— 

< 25 

> 25 

— 

\rnm - mz| [GeV] 

— 

— 

< 15 

> 15 

— 

< 15 

Min. [GeV] 

> 65{mf„4) 

> 12 

> 12 

> 12 

> 12 

> 12 

Max. rni^ [GeV] 

— 

< 200 

< 200 

< 200 

— 

< 200 

1 

— 

< 2.0 

< 2.0 

< 2.0 

— 

< 2.0 


Table 5. Definition of control regions in the and 3£ analyses. Selections indicated in boldface 
font are designed to retain the CR orthogonal to the relevant SR. 


6.1 Background in the four-lepton channel 

The main backgrounds that contribute to the 4t'-2SFOS and 4£-lSFOS SRs are diboson 
processes, dominated by ZZ* with from Z ^ tt decay, and triboson processes, in 

particular ZWW*, which has the same signature as the signal. These processes respectively 
account for about 85% and 15% of the total background contamination. To normalise ZZ* a 
dedicated CR, the M-ZZ CR, is defined by inverting the requirement on the invariant mass 
of dileptons from the Higgs boson candidate. All the other minor background processes, 
listed in table 4, are modelled by simulation. 
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Channel 

DFOS 2i 

SS 21 

CR 

OSTop 

Z ^ TT 

W 7 

TFZ 

IFIF 

SSTop 

Zjets 

Number of leptons 

2 

2 

2 

3 

2 

2 

2 




> 1 conversion e 





Total lepton charge 

0 

0 

±2 

±1 

0 

0 

0 

Number of SFOS 

0 

0 

— 

— 

— 

— 

— 

Number of jets 

> 2 

> 2 

2 or 1 

2 or 1 

2 or 1 

2 or 1 

2 or 1 

Number of 6 -jets 

0 

0 

0 

0 

0 

> 1 

0 

[GeV] 

> 20 

> 20 

> 45 (Ij) 

>45(lj) 

> 85 (Ij) 

> 45 (lj,ee, ii/j,) 

> 45 (Ij) 







> 60 (lj,e^) 

< 85 (lj,e^) 




> 50 (2j) 

>50 (2j) 

> 80 ( 2 j) 

> 50 (2j,ee, nn) 

> 50 (2j,ee, /i/i) 







> 60 ( 2 j,e^) 

< 80 ( 2 j,e^) 

- mz\ [GeV] 

— 

— 

— 

< 15 {OS ee,iiii) 

> 15 {ee,^/i) 

> 15 {ee,iiii) 

< 15 (ee, fifi) 

Min. ran [GeV] 

> 90 (8 TeV) 

> 10 

> 12 (ee,/i/i) 

> 12 (ee, fifi) 

> 12 

> 12 {ee,iiii) 

> 12 (ee, fin) 


> 80 (7 TeV) 










> 10 (e^) 

> 10 (efi) 

> 10 {efi) 

> 12 (e/i) 

> 55 {efi) 

Max. ran [GeV] 

— 

< 70 

< 50 

— 

— 

— 

< 80 (e^) 

airr [GeV] 

< {mz - 25) 

— 

— 

— 

— 

— 


[rad] 

— 

> 2.8 

< 2.5 

— 

— 

— 


m-T [GeV] 

— 

— 

> 105 (Ij) 

> 105 (Ij) 

> 105 (Ij) 

> 105 (Ij) 

— 

Min. m^.j [GeV] 

— 

— 

< 70 

< 70 

< 70 

< 70 

< 70 

Min. ra£.jj [GeV] 

— 

— 

< 115 

< 115 

< 115 

< 115 

< 115 

Min. (p£.j [rad] 

— 

— 

< 1.5 

< 1.5 

— 

— 

— 

p“ [GeV] 

— 

— 

> 30 

— 

— 

— 

— 


Table 6. Definition of control regions in the ‘li analyses. Selections indicated in boldface font are 
designed to keep the CR orthogonal to the relevant SR. 


6.2 Background in the three-lepton channel 

Three classes of backgrounds contribute to the channel. The first class comprises diboson 
processes; VTZ/IT 7 *, ZZ* with an undetected lepton mainly due to its low-pT, and Z 7 , 
in which the photon converts to electron-positron pairs. The ZZ* contribution in this 
channel is mainly due to single-resonant ZZ* production where the three-lepton invariant 
mass is just below the Z boson mass. The second class includes triboson processes, mainly 
WWW*. The last class of backgrounds are processes with a misidentified lepton, mainly 
Z-bjets and top-quark pair production. 

In the 3£-3SF and 3t'-lSFOS SRs, WZ/W^* and ZZ* represent the leading background 
contributions accounting for about 80% of the total background yields, with 65% from 
WZjW'y* and 15% from ZZ*. Production of Z 7 , VVV, Z-|-jets and top-quarks share the 
remaining background fraction equally. The 3%OSFOS SR contains contributions of similar 
size from ITZ/IT 7 *, VVV and top-quark production. In this SR the total background 
event yield is about eight times lower than in the 3%3SF and 3%1SF0S SRs. 

A 3^-ITZ CR is defined by reversing the Z-veto requirement, in order to select events 
with a Z boson decay. The Zi-ZZ CR and 2>£-Z^ CR are defined by requiring low 
values to reflect the absence of final-state neutrinos in the background process under study. 
For these control regions, the invariant mass of the three leptons must be consistent with the 
Z boson mass. These regions are further distinguished according to the flavour combination 
of the three leptons, namely eee or for the ZVZ^ CR, and or ee/x for the ‘il-ZZ 
CR. 

The 3£-Zjets CR is defined by reversing the and the Z-veto selections. The 

properties of misidentified electrons and muons are different; therefore the 3%Zjets CR is 
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further split into the misidentified-electron component (eee+/r/ie events) and misidentified- 
muon component (/i/U/x + ee/x events) and an NF is assigned to each component. In the 
7 TeV data sample, the predicted Z+jets event yield with misidentified muons in the M 
SRs is negligible. Furthermore, the number of events in the St'-Zjets CR with such a lepton 
flavour combination is too small to reliably extract the NF. Therefore the estimation of the 
misidentified-muon component is taken directly from simulation. 

The 3£-Top CR is defined by requiring at least one 6-tagged jet. The Z+jets contri¬ 
bution is difficult to isolate from other processes that include Z bosons. Thus the NF for 
this process is constrained not only by the 3.^-Zjets CR, but also in part by the 2 >i-WZ CR 
and the ‘ii-ZZ CR, as indicated in table 4. 

6.3 Background in the opposite-sign two-lepton channel 

The dominant background in this channel is top-quark production, which accounts for 
about 50% of the total contamination. The 2^-OSTop CR is defined by requiring a high 
invariant mass of the lepton pair in the final state. As the 6-jet rejection criteria are the 
same in the CR and SR, the systematic uncertainties related to 6-tagging largely cancel 
between the two regions. The second dominant background is Z —)• rr, which accounts for 
20% of the total background in the SR. A dedicated control region, 2i-ZTT CR, is defined by 
requiring a large opening angle between the two leptons. The WW process constitutes the 
third largest background, accounting for 10% of the total. Due to a difficulty in separating 
this process from ti events over a wide kinematic region, no dedicated CR is defined, and 
this process is modelled purely by MC simulation. 

The contribution of backgrounds with misidentified leptons, VF-l-jets and multijet, 
accounts for 10% of the total background. The misidentified-lepton background rate has 
an uncertainty of 40%. Due to this large uncertainty, the misidentified-lepton background 
contributes significantly to this channel. The WZ/Wj* production and the ggF production 
followed by H^WW* decay, each representing 5% of the total background, are modelled 
with MC simulation. 

6.4 Background in the same-sign two-lepton channel 

The WZjW^* and VF-l-jets processes each account for one third of the total background. 
Some of the WZjW^* events with three leptons enter the selection when one of the leptons 
escapes detection. To normalise this process, the 2i-W Z CR is defined by selecting events 
with three leptons. The contamination from lF-|-jets events with one misidentified lepton 
is estimated by using the same data-driven method used in the 2£-DFOS channel. 

The remaining background processes contribute at the 10% level or less. The nor¬ 
malisation of VFy is based on the 2 .^-lT 7 CR, defined by requiring at least one electron 
consistent with a conversion, including a requirement that the electron does not have a 
hit in the innermost pixel layer. The background contribution to the 2i-SS channel due 
to lepton charge misidentification, in otherwise charge-symmetric processes, is found to 
be relevant only for electrons and affects top-quark production, opposite-sign WW and 
Z-|-jets. It represents 10% of the total background in the 21’-SSljet SR and 3% of the 
total background in the 2£-SS2jet SR. The 2£-SSTop CR, 2£-WW CR and 2£-Zjets CR 
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are defined selecting opposite-sign leptons to normalise these contributions. Moreover, in 
2.^-SSTop CR at least one 6-tagged jet is selected. Due to the small production rate, no 
control region is defined to normalise the WW events from vector boson scattering with 
same charge, whose rate is taken directly from simulation. 

6.5 Normalisation factors and composition of control regions 

NFs are computed through the signal extraction fit explained in section 8.4. Table 4 lists 
the main background processes, along with the CRs that contribute to the determination of 
their NFs. The NFs, which are specific to each signal region, are fitted taking into account 
only the total number of expected and observed events in each CR, separately for the 8 TeV 
and 7 TeV data samples, and are summarised in table 7. The numbers of observed and 
expected events from simulation in the 8 TeV and 7 TeV data analysis are summarised in 
table 8. 

Background spectra and the expected composition of the CRs in 8 TeV collisions are 
shown in figures 2-5. The VH{H^ WW*) component is shown on top of the background 
to demonstrate that there is no significant signal leakage in the CRs. The exact amount of 
signal leakage in each CR is presented in table 8. 

In these tables and figures, each background process normalised using CRs is presented 
separately, while backgrounds that are not normalised using CRs are grouped together as 
“Others”. The ggF and VBF production of Higgs bosons, and the VH{H —)■ rr) process 
are included in the “Others” category, assuming mn = 125 GeV and the SM value for the 
cross sections and for the branching fraction. 
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(a) 8 TeV data sample 


Channel 

u 

u 

2i 

Category 

2SFOS, ISFOS 

3SF, ISFOS, OSFOS 

DFOS 

SS2jet, SSljet 

Process 





WZjW-i* 

— 

-•-•Uo-o.oe 

— 

0.94 ±0.10 

zz* 

OS ww 


1 9 C+ 0-22 

— 

0.80 ±0.33 

Wj 

Zj 

— 

'J-O^-0.14 

— 

1.06 ± 0.12 

Zh* 

— 

0.80+°J® (At-misid) 

0 90+°-^^ 

0.86 ±0.30 



0.33lg (e-misid) 



Top 

— 

1 ‘tf5+0-34 

1 nc;+0.16 

-‘-■U^-0.14 

1.04 ±0.08 


(b) 7 TeV data sample 

Process 




WZjW-i* 

— 

1 02 '’“^'^^ 
-'-•^^- 0.11 

— 

zz* 

1 c;q+0.36 
-'■•^^-0.31 

1 7Q+0-51 
'^-0.42 

— 

OS ww 

— 

— 

— 

Wj 

Zj 

— 

^•^^-0.09 

— 

Zjj* 

— 

O.OSIq}® (e-misid) 

111 +0-38 
-‘■•'‘--‘■-0.34 

Top 

— 

1 nc-i-o.ee 

n oQ+o.ie 
^•^'J-0.14 


Table 7. Summary of background normalisation factors in the (a) 8 TeV and (b) 7 TeV data 
samples. The uncertainties include both the statistical and systematic components (see section 7). 
“—” denotes that the background process, when considered, is normalised by MC simulation. 
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(a) 8 TeV data sample 


Channel 

4£ 

3^ 

DFOS 

CR 

ZZ 

wz 

ZZ 

Zjets 

Top 

Zi 

Ztt 

OSTop 

Observed events 

122 

578 

60 

251 

55 

156 

328 

1169 

MC prediction 

121±16 

576±63 

60±10 

249±46 

55±12 

155±31 

326±55 

1160±150 

MC (no NFs) 

118±10 

543±50 

48±4 

351±40 

48±6 

188±17 

354±56 

1120±140 

Composition (%) 









WZjW-i* 


89.3±1.5 

5.5±1.0 

25.9±3.5 

20±4 

1.68±0.31 

— 

~ 

zz* 

99.49±0.17 

6.7±1.2 

90.1±2.1 

38±5 

3.6±1.2 

47±6 

— 

— 

Zi 


0.54±0.17 

0.6±0.5 

5.5±1.5 

2.4±0.9 

43±7 

_ 

— 

.Z+jets 


l.liO.5 

2.1±1.5 

29±7 

5.50±3.34 

8.3±3.4 

78.2±2.8 

0.7±0.4 

Top 

0.019±0.012 

0.66±0.18 

0.27±0.13 

0.081±0.034 

64±6 

0.13±0.06 

10.5±1.6 

71.3±3.3 

Others 

0.49±0.17 

0.80±0.16 

1.16±0.20 

0.87±0.13 

3.6±0.6 

0.33±0.06 

11.2±1.9 

27.8±3.2 

VH (H ^WW) 

0.026±0.006 

0.93±0.16 

0.26±0.11 

0.37±0.09 

0.52±0.13 

0.052±0.011 

O.lOOiO.OlS 

0.21±0.04 


Channel T 

1 SS 2£ 

CR 1 

1 

IFZ 

WW 

SSTop 

Zjets 

Observed events 

I 228 

331 

769 

5142 

39731 

MC prediction 

229±41 

311±66 

742=h63 

5080±350 

41000±14000 

MC {no NFs) | 

1 218±35 

335±68 

787±58 

4930±330 

47000±16000 

Composition {%) 






IF 7 

85.0±2.4 

-■ 

0.46±0.14 

0.049±0.018 

0.022±0.007 

IFZ/IF 7 * 

1.02±0.27 

85±4 

2.34±0.24 

0.200±0.029 

0.38±0.09 

TVIF 

0.37±0.08 

0.028±0.014 

23.9±2.3 

1.43±0.21 

0.57±0.15 

Z+jets 

4.2±1.6 

7.0±3.5 

7.0±2.0 

2.2±0.7 

97.7±0.5 

Top 

0.68±0.20 

1.50±0.29 

62.7±2.8 

95.5±0.8 

0.86±0.21 

Others 

8.7±1.2 

5.3±1.2 

3.2±0.4 

0.63±0.11 

0.44±0.11 

VH {H WW*] 1 


0.77±0.17 

0.32±0.04 

0.036±0.005 

0.0077±0.0020 


(b) 7 TeV data sample 


Channel 

AH 

3^ 

DFOS 2H 

CR 

ZZ 

TVZ 

ZZ 

Zjets 

Top 

27 

Ztt 

OSTop 

Observed events 

24 

101 

18 

68 

9 

123 

55 

137 

MC prediction 

24±8 

101±16 

18±5 

67±15 

8±4 

123±26 

55±15 

137±20 

MC {no NFs) 

15±5 

99±10 

10.7±0.6 

81±7 

8.1±1.4 

208±12 

51±12 

145±18 

Composition {%) 









WZIW-i' 


87.5±2.5 

3.1±1.1 

6.9±1.4 

14±5 

0.61±0.15 



ZZ* 

99.71±0.12 

7.4±2.1 

92.7±2.3 

26±6 

4.2±2.5 

32±7 


— 

z~l 


1.8±0.8 

0.5±0.4 

48±7 

6±4 

59±7 


— 

Z+jets 


1.5±0.8 

3.0±1.4 

19±5 

0.4±2.2 

8.2±2.1 

76±6 

0.14±0.15 

Top 

0.031±0.015 

0.7±0.4 

0.01±0.20 

0.07±0.13 

71±10 

0.03±0.04 

13±5 

75.2±3.2 

Others 

0.23±0.11 

0.56±0.11 

0.44±0.11 

0.115±0.021 

4.2±1.4 

0.05±0.17 

11±4 

24.7±3.2 

VH [H WW*) 

0.02±0.31 

0.53±0.08 

0.106±0.030 

0.044±0.008 

0.41±0.17 

0.0154±0.0027 

0.048±0.017 

0.135±0.030 


Table 8. Number of observed and predicted events and background composition in the CRs for 
the 4£, Zl and 2£ channels in the (a) 8 TeV and (b) 7 TeV data samples. Normalisation factors are 
taken into account in the calculation of the composition. The uncertainties on event yields include 
both the statistical and systematic components (see section 7). 
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(e) 


Figure 2. Distributions of the invariant mass of leptons €0 and (- 2 , defined in secton 2, in 
the five CRs defined in the 3^ channel: (a) M-WZ CR, (b) M-ZZ CR, (c) SCZjets CR, (d) 3C 
Top CR and (e) M-Z^ CR. Data (points) are compared to the background plus VH{H^WW*) 
(TOij=125 GeV) signal expectation (stacked filled histograms), where the background contributions 
are normalised by applying the normalisation factors shown in table 7. The hatched area on the 
histogram represents total uncertainty, both statistical and systematic (see section 7), on the total 
background estimate. The last bin includes overflows. 
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Figure 3. Distribution of the 4-lepton invariant mass 7714 ^ in the CR control region. Data 

(points) are compared to the background plus VH{H^WW*) (m_f/=125 GeV) signal expectation 
(stacked filled histograms), where ZZ* events are normalised by applying the normalisation factor 
shown in table 7. The hatched area on the histogram represents total uncertainty, both statistical 
and systematic (see section 7), on the total background estimate. The last bin includes overflows. 




(a) 


(b) 


Figure 4. Distributions of the difference in rapidity between the two leading jets Ayjj (a) 
in the 2£-OSTop CR and (b) in the 2£-Ztt CR. Data (points) are compared to the background 
plus VH{H^WW*) ( 77774=125 GeV) signal expectation (stacked filled histograms), where the 
background contributions are normalised by applying the normalisation factors shown in table 7. 
The hatched area on the histogram represents total uncertainty, both statistical and systematic (see 
section 7), on the total background estimate. The last bin includes overflows. 
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Figure 5. Distribution of relevant variables in the control regions in the 2£-SS channel: (a) 
azimuthal angle between the two leptons in the 2i-Wy CR, (b) transverse momentum of 

the leading lepton in the 2£-WZ CR, (c) transverse mass tot in the 2^-WW CR, (d) missing 
transverse momentum in the 2£-SSTop CR and (e) in the 2t'-Zjets CR. Data (points) 

are compared to the background plus VH{H^ WW*) (m//=125 GeV) signal expectation (stacked 
hlled histograms), where the background contributions are normalised by applying the normalisation 
factors shown in table 7. The hatched area on the histogram represents total uncertainty, both 
statistical and systematic (see section 7), on the total background estimate. The last bin includes 
overflows. 
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7 Systematic uncertainties 


The sources of theoretical and experimental systematic uncertainty on the signal and back¬ 
ground are described in this section, and summarised in table 9. 


(a) Uncertainties on the VH{H^WW*) process (%) 


Channel 

4e 

3£ 

2t 

Category 

2SFOS 

ISFOS 

3SF 

ISFOS 

OSFOS 

DFOS 

SS2jet 

SSljet 

Theoretical uncertainties 









VH acceptance 

9.2 

9.3 

9.9 

9.9 

9.9 

10 

10 

9.9 

Higgs boson branching fraction 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

4.2 

QCD scale 

3.1 

3.0 

1.2 

1.0 

1.0 

1.3 

1.0 

1.0 

PDF and as 

1.0 

1.1 

2.1 

2.2 

2.2 

1.9 

2.3 

2.2 

VH NLO EW corrections 

1.7 

1.8 

1.9 

1.9 

1.9 

1.9 

1.9 

1.9 

Experimental uncertainties 









Jet 

2.0 

3.1 

2.5 

2.5 

2.9 

3.2 

8.9 

5.8 

soft term 

0.2 

0.3 

- 

- 

- 

0.3 

0.6 

0.2 

Electron 

2.6 

2.8 

1.6 

2.2 

2.2 

1.5 

2.1 

1.7 

Muon 

2.6 

2.4 

2.2 

1.8 

1.7 

0.8 

1.8 

1.9 

Trigger efficiency 

0.2 

- 

0.4 

0.3 

0.3 

0.5 

0.6 

0.5 

6-tagging efficiency 

0.9 

0.9 

0.9 

0.8 

0.8 

2.9 

3.5 

2.4 

Pile-up 

1.9 

0.7 

2.0 

1.4 

0.8 

1.7 

1.0 

2.4 

Luminosity 

2.8 

2.8 

2.8 

2.8 

2.8 

2.8 

2.8 

2.8 


(b) Uncertainties on the total background (%) 


Theoretical uncertainties 









QCD scale 

0.2 

0.1 

1.0 

0.9 

- 

3.7 

13 

2.3 

PDF and as 

0.2 

2.4 

0.3 

0.3 

1.6 

1.4 

0.5 

0.6 

VVV iC-factor 

2.8 

8.1 

1.1 

1.9 

0.5 

- 

- 

0.3 

MC modelling 

Experimental uncertainties 

5.3 

4.3 

7.0 

6.6 

— 

4.1 

0.8 

1.4 

Jet 

3.1 

2.4 

3.2 

1.8 

4.1 

7.2 

5.0 

3.4 

E^p'^^ soft term 

2.3 

0.6 

1.8 

1.9 

0.5 

1.1 

0.2 

0.7 

Electron 

1.0 

1.4 

1.0 

0.4 

1.1 

0.7 

1.1 

0.8 

Muon 

1.1 

1.2 

0.4 

0.7 

0.2 

0.2 

0.4 

0.8 

Trigger efficiency 

- 

0.2 

0.2 

- 

- 

0.1 

- 

- 

6-tagging efficiency 

0.6 

0.8 

0.6 

0.8 

2.6 

0.7 

1.4 

0.3 

Fake factor 

- 

- 

- 

- 

- 

2.8 

10 

10 

Charge mis-assignment 

- 

- 

- 

- 

1.4 

- 

0.7 

0.8 

Photon conversion rate 

- 

- 

- 

- 

- 

- 

1.1 

0.9 

Pile-up 

1.2 

1.1 

1.4 

0.3 

1.2 

0.9 

1.0 

1.0 

Luminosity 

0.4 

0.8 

0.1 

0.2 

0.7 

- 

0.7 

0.3 

MC statistics 

5.3 

8.0 

3.8 

3.2 

5.5 

3.1 

7.3 

3.9 

CR statistics 

8.1 

6.6 

4.2 

3.9 

8.8 

2.5 

2.8 

3.5 


Table 9. Theoretical and experimental uncertainties, in %, on the predictions of the (a) signal 
and (b) total background for each category. Fake factor refers to the data-driven estimates of the 
IT-bjets and multijet backgrounds in the channels. The dash symbol (-) indicates that the 
corresponding uncertainties either do not apply or are negligible. The values are obtained through 
the fit and given for the 8 TeV data sample. Similar values are obtained for the 7 TeV data sample. 

The table shows the uncertainties on the estimated event yield after the fit in 8 TeV 
data samples. Similar values are obtained for the 7 TeV data analysis. 
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Systematic uncertainties have an impact on the estimates of the signal and background 
event yields in the SRs and CRs. The experimental uncertainties are applied to both 
the SRs and CRs. The extrapolation parameter from a SR to a CR is defined as the 
ratio of MC estimates in the SR and CR. The theoretical uncertainties are computed on 
the extrapolation parameters, and applied to SRs. Correlations between SRs and CRs 
are taken into account in the fit, and cancellation effects are expected for uncertainties 
correlated between SRs and CRs. The different uncertainty sources are treated according 
to their correlations across the data taking periods, among different analyses, between 
signal and background sources. Whenever an effect is expected to affect coherently the 
event yields in two event samples (SRs and/or CRs), for instance the muon reconstruction 
efficiency for both signal and background, a 100% correlation is assumed and the yields 
are varied coherently in the fit through the introduction of a single parameter (nuisance 
parameter). Alternatively when an uncertainty source is not expected to affect the event 
yields coherently, for instance trigger and luminosity uncertainties across different data 
taking years, different nuisance parameters are introduced to represent uncorrelated effects. 

The “MC statistics” and “CR statistics” uncertainties in table 9 (b) arise from limited 
simulated events in the SRs and CRs and from the number of data events populating the 
CRs, respectively. 

7.1 Theoretical uncertainties 

The theoretical uncertainties on the total Higgs boson production cross section and branch¬ 
ing fraction are evaluated by following the recommendation of the LHC Higgs cross-section 
working group [21, 44, 45]. Uncertainties concerning QCD renormalisation and factorisa¬ 
tion scales, which are hereafter collectively referred to as QCD scales, PDF, the value of 
as and branching fraction are estimated. 

The main uncertainty on the VH{H^ lUVF*) process, shown in the “'VH Acceptance” 
row in table 9(a), accounts for the uncertainties in the acceptance of the signal processes 
and it is evaluated with MINLO-Uifj Powheg-BoxI.O [72] simulation. The leading con¬ 
tributions are the missing next to leading order QCD contributions in qq ^ VH simulated 
with PythiaS and the parton shower uncertainty. The first is evaluated by comparing 
the PythiaS prediction with the MINLO-U77j interfaced with PythiaS and amounts to 
7%. The second is computed by comparing MINLO-U77j interfaced with PythiaS with 
MINLO-U77j interfaced with Herwig and accounts for a further 7%. 

The cuts on the number of jets are found to be the main source of such uncertainties. 
The uncertainty on the acceptance of the gg —?■ ZH process is estimated to be 5% in the 
4i channel, the only channel where this process is relevant. 

Uncertainty on the Higgs boson branching fraction to WW*, which is particularly 
important for the VH and VBF production modes, amounts to 4%. The QCD scale uncer¬ 
tainty is 1% for WH production and 3% for ZH production. This uncertainty is larger for 
ZH production due to the gg —)• ZH contribution. The “VH NLO EW corrections” refers 
to additional uncertainties on the corrections [48] to the NLO differential cross section, 
applied as a function of the pT of the associated weak bosons in the LO WH and ZH 
production modes generated by PythiaS. The size of this uncertainty is 2%. 
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The uncertainties on ggF production are due to the inclusive cross section dependence 
from QCD scales, the PDF choice and as, they range from 7% to 8%. In the 2£-DFOS 
channel, an additional uncertainty due to the jet multiplicity cut is computed with MCFM 
[55] and amounts to about 12%. 

The uncertainties from the QCD scales of backgrounds are estimated by varying the 
scales up and down independently by a factor of two. MCFM is used to estimate this 
uncertainty in the 4£, 3£ and 2^-SS channels. In the 4£ channel the uncertainty on ZZ* 
is 4% in the SRs and CR. However, due to the cancellation between the SRs and CR, 
the resultant effect becomes negligible. In the St'-SSF and 3£-lSFOS SRs the QCD scale 
uncertainty on WZjW^* is determined in each bin of the “BDT Score” and ranges between 
3% and 6%. In the 2£-SS channel, due to cancellations, the QCD scale uncertainties on 
WZ/W'y* and VFy are found to be negligible with the exception of the 2£-SS2jet SR, in 
which 100% uncertainty is assigned to VFy. The QCD scale uncertainty on the same-sign 
VFVF-|-2jet, estimated using VBF@NLO [73], is of the order of 40%. In the 2^-DFOS 
channel, QCD scale uncertainties on top-quark and IFIT-|-2jet production with at least 
two QCD couplings, referred to as QCD WW in the following, are estimated as 9% and 
17% by using MC@NLO and MadGraph, respectively. 

The PDF uncertainties on backgrounds are calculated by following the PDF4LHC 
recipe [74] using the envelope of predictions from MSTW2008, CTIO and NNPDF2.3 PDF 
sets with the exception of top-quark production in the 2GDFOS channel, which is evalu¬ 
ated with the same technique used in the ggF-enriched rij > 2 category in ref. [22]. The 
uncertainties range from 1% to 6%, depending on the background process and the cate¬ 
gories. An uncertainty of 33% on the NLO TC-factor for the triboson process is evaluated 
by using VBF@NLO; in the 3GOSFOS SR this uncertainty is estimated in bins of 
and ranges from 1% to 6%. 

The “MC modelling” row in table 9(b) takes the yield variation observed between 
predictions of different MC generators. In the 3£ and 2GSS channels, the uncertainties on 
the WZjW'j* event yield due to the modelling of the underlying event, parton shower and 
matching of the matrix element to the parton shower are evaluated by comparing the predic¬ 
tions of Powheg-Box-I-Pythia and MC@NLO-|-Herwig. In the 2GDFOS channel, an 
uncertainty due to underlying event and parton shower modelling is assigned to top-quark 
production and Z —?■ rr by comparing the expectations from Powheg-Box-|-Pythia 
and Powheg-Box-I-Herwig, and Alpgen-|-Pythia and Alpgen-|-Herwig, respectively. 
The QCD WW event yield in the 2t'-DFOS channel is estimated using Sherpa. The uncer¬ 
tainty from the underlying event, parton shower modelling and matrix element implemen¬ 
tation is evaluated through a comparison with MadGraph-|-Pythia. The uncertainty on 
the main background in the 4£ channel, ZZ*, is dominated by the statistical component; 
a systematic uncertainty from the different models, underlying event and parton shower is 
assigned through a comparison between Powheg-Box-|-Pythia and Sherpa. 

7.2 Experimental uncertainties 

One of the dominant experimental uncertainties, labelled “Jet” in table 9, derives from 
the propagation of the jet energy scale calibration and resolution uncertainties. They were 
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derived from a combination of simulation, test-beam data, and in situ measurements [75]. 
Additional uncertainties due to differences between quark and gluon jets, and between light- 
and heavy-flavour jets, as well as the effect of pile-up interactions are included. For jets 
used in this analysis, the jet energy scale uncertainty ranges from 1% to 7%, depending 
on pt and rj . The relative uncertainty on the jet energy resolution ranges from 2% to 
40%, with the largest value of the resolution and relative uncertainty occurring at the px 
threshold of the jet selection. 

The “Muon” and “Electron” uncertainties include those from lepton reconstruction, 
identification and isolation, as well as lepton energy and momentum measurements. The 
“Trigger efficiency” uncertainty in table 9 refers to the uncertainty on the lepton trigger 
efficiencies. The uncertainties on the lepton and trigger efficiencies are of the order of 1% 
or smaller. 

The changes in jet, electron and muon energy scale uncertainties due to varying them 
by their systematic uncertainties are propagated to the evaluation and included 

in the “Jet”, “Electron” and “Muon” rows in table 9. An additional “E™®® soft term” 
uncertainty is associated with the contribution of calorimeter energy deposits not assigned 
to any reconstructed objects in the E™®® reconstruction [68-70]. 

The “ 6 -tagging efficiency” row refers to the uncertainties on the efficiency of tagging 
of 6 -jets and include contributions from 6 -jet identification and charm and light-flavour jet 
rejection factors [67, 76]. The uncertainties related to 6 -jet identification range from <1% 
to 8 %. The uncertainties on the misidentification rate for light-quark jets depend on 
and rf, and have a range of 9-19%. The uncertainties on c-jets reconstructed as 6 -jets range 
between 6 % and 14% depending on px- 

The uncertainty labelled as “Take factor” is associated with the data-driven estimates 
of the IT-l-jets and multijet backgrounds in the 21 channels; it ranges between 35% and 45% 
depending on the sample and on categories. A “Charge mis-assignment” systematic uncer¬ 
tainty is estimated to account for the mismodelling of the charge flip effect by comparing 
the number of lepton pairs with same charge and opposite charge under the Z boson mass 
peak in data and MC simulation, resulting in a 16% relative uncertainty. The uncertainty 
is assigned to IFZ/IF 7 * in the 3EOSFOS SR, and top-quark production, Ej-jets and WW 
in the 2ESS channel. 

The uncertainty labelled as “Photon conversion rate” is assigned to IT 7 in the 2ESS 
channel, and is evaluated by comparing the yield in data and in MC simulation for events 
with two muons and one electron with no hit on the innermost pixel detector layer. It is 
relevant only for the 2ESS channel and has a size of 6.5% for VF 7 . 

The “Pile-up” field in table 9 includes the uncertainty on the weights applied to all 
simulated events to match the distribution of the number of pile-up interactions to that 
of data. The uncertainty on the integrated luminosity for the 2012 data is ± 2.8% and it 
is derived following the same methodology as that detailed in ref. [77]. Eor the 2011 data 
the uncertainty on the integrated luminosity is ± 1.8% [77]. The backgrounds normalised 
with CR data are not affected by the luminosity uncertainty. 

The dominant systematic uncertainties on the VH{H^WW*) process in the M and 
2>i channels are due to uncertainties on lepton reconstruction and on the jet energy scale 
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and resolution. In the 2i channels, the jet energy scale and resolution uncertainties are the 
most important. 

8 Results 

The data collected at = 8 TeV and 7 TeV are analysed separately and then combined 
in all channels in order to search for the Higgs boson m.WH and ZH production using 
if—> WW* decays. The analyses are optimised for a Higgs boson mass of niH = 125 GeV. 
For this mass, the selected Higgs bosons decay mainly as if —)• WW* —)• £v + X, but 
a small contamination from the VH production of Higgs bosons, from the decay chain 
H ^ TT ^ iuii', is present. This contribution is treated as background and normalised to 
the SM expectation for the VH production cross section avH, and the H ^ tt branching 
fraction. 

This section is subdivided in five sub-sections. Section 8.1 shows the event yields in 
each category of the 8 TeV and 7 TeV data samples. Furthermore, distributions of some 
of the relevant variables are shown for the 8 TeV data sample. Section 8.2 summarises the 
statistical treatment used for the signal extraction. Section 8.3 quantifies the agreement 
with the background only hypothesis in each analysis category and evaluates the observed 
and expected significance in each of them. Section 8.4 reports the significance for WH, 
ZH and combined VH production, moreover it shows the measurement of their cross 
sections divided by their SM expectations (^). The VH categories are then combined with 
the categories of the ggF and VBF analysis using H —)• WW* —)■ iuH decays described in 
ref. [22]. In addition the ggF and VBF yields are extracted in a consistent manner together 
with the VH yields. Finally, section 8.5 shows the measurement of the couplings to vector 
bosons and to fermions obtained from the combination of the analyses sensitive to the three 
production modes using H —)• WW* decays. 

8.1 Event yields and distributions 

The number of events in each category is summarised in table 10 and in table 11 for the 
8 TeV and 7 TeV data sample, respectively. In the 2GDFOS and 2I'-SSljet categories the 
numbers of observed events are slightly larger than the expectation. The lepton flavour 
composition of the events in the 2GSS channel is shown in table 12, in which the high event 
yield of the 2GSSljet SR mainly originates from the and //e channels. The behaviour 
of the observed data yield as a function of the data period was studied. The events were 
uniformly distributed according to the acquired luminosity as expected, excluding tempo¬ 
rary failures of the detector subsystems as an explanation of the excess. Moreover it was 
checked that known detector defects were not increasing the rate of particular background 
sources. Finally, the kinematic distributions of the events were analysed in order to look for 
striking features pointing to some particular missing background contribution. Because no 
particular problem was found, the data excesses were attributed to statistical fluctuations. 

Distributions of some of the relevant variables after the event selection are presented 
for 8 TeV data in figure 6 for the 4£ analyses, in figure 7 for the analyses and in figure 8 
for the 2£ analyses. Figures 6(a) and 6(c) show the opening angle between the leptons in the 
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frame where the px of the Higgs boson is zero, in 4t'-lSFOS events. Figures 6 (b) 

and 6 (d) show the in 4£-2SFOS events. Figures 7(a) and 7(b) present the “BDT 

Score” distribution in the 3£-3SF and 3t'-lSFOS SRs, respectively, and figure 7(c) shows 
the distribution of AR between the leptons from the Higgs boson candidate, in the 

3£-OSFOS SR. Figures 6 (a) and 6 (b) are obtained by applying the selections in table 2 only 
down to the cuts on the p™®®, in order to maximise the number of events, while the full 
selection is applied to produce the distributions in hgures 6 (c) and 6 (d). The distributions 
in figures 7(a)-7(c) are shown with all the selections applied except for the one on the 
displayed variable. Figure 8 (a) presents the transverse mass, mx, in the 2£-DFOS SR, 
while figure 8 (b) and figure 8 (c) show the smallest opening angle in the transverse plane 
between a lepton and a jet, A(/)™“, in the 2^-SSljet SR and the 2^-SS2jet SR, respectively. 
The distributions in figure 8 are shown with all the selections applied except for the one on 
the displayed variable. No data populates figure 6 (d). In all distributions, good agreement 
between data and the MC prediction is observed. 

8.2 Statistical method 

The signal extraction is performed using the profile likelihood ratio method [78], which 
consists of maximising a binned likelihood function £(/U, 9 \ n). The likelihood is the prod¬ 
uct of Poisson distributions for each SR and CR. The mean values of the distributions are 
the sum of the expected yields of signal and background. The symbol n represents the ob¬ 
served events in each SR and CR. The signal and background expectations are functions of 
the signal-strength parameter, /x, and a set of nuisance parameters, 9. The signal strength 
/X multiplies the SM predicted signal event yield in all categories, while background nor¬ 
malisation factors, included as nuisance parameters, represent corrections for background 
sources normalised to data. Signal and background predictions are affected by systematic 
uncertainties that are described by nuisance parameters. The normalisation factors are 
left free in the fit, while the constraints on the systematic uncertainties are chosen to be 
log-normal distributions. 

The test statistic is defined as 

Qfj, = -21n ^^^’ = -21nA. (8.1) 

'^max 

The symbol 0^ indicates the nuisance parameter values at the maximum of the likelihood 
function for a given /x. The denominator is the maximum value of £ obtained with both 
and 0 floating. When the denominator is maximised, /x takes the value of /x. The pq value 
is computed for the test statistic go evaluated at /U = 0 in eq. (8.1), and is defined to be the 
probability to obtain a value of go larger than the observed value under the background- 
only hypothesis. There are no bounds on /x, although go is defined to be negative if /x < 0. 
The equivalent formulation, expressed in terms of the number of standard deviations cr, is 
referred to as the local significance Zq. The signal acceptance for all production modes and 
decays are computed assuming mn = 125.36 GeV, which is the mass measured in 77 —t- 77 
and 77 —)• 47 decays by ATLAS [79] . The acceptance for this mass is obtained interpolating 
between the values computed at mn = 125 and 130 GeV. 
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Figure 6. Distributions of the angular separation in (j) between the opposite-sign lepton pair 
from the Higgs boson decay candidate in the frame where the px of the Higgs boson is zero, 
in the 4£ analyses using the 8 TeV data sample: (a) and (c) with 4€-lSFOS events, and 
(b) and (d) with 4£-2SFOS events. Figures (a) and (b) are obtained by applying the cuts in 
table 2 down to the track missing px (Px’^**) selection, removing the other selections in order to 
increase the otherwise very limited number of events, while the distributions in (c) and (d) have 
all the selections applied. Data (points) are compared to the background plus the VH{H^ WW*) 
(TOi/=125 GeV) signal expectation (stacked filled histograms), where the background components 
are normalised by applying the normalisation factors shown in table 7. The hatched area on the 
histogram represents the total uncertainty on the background estimate including the statistical and 
systematic uncertainties added in quadrature. 
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(c) 

Figure 7. Distributions of relevant quantities for the 3£ analyses using the 8 TeV data sample: 
(a) “BDT Score” in the 3^-3SF and (b) in the 3^-lSFOS SRs, and (c) the angular separation in 
R of the two opposite-sign leptons with smaller AR distance, AR^^g^, in the 3£-OSFOS SR. The 
distributions are shown with all the selections applied except for the one on the displayed variable. 
Data (points) are compared to the background plus the VH{H^WW*) (m_f/=125 GeV) signal 
expectation (stacked filled histograms), where the background components are normalised by ap¬ 
plying the normalisation factors shown in table 7. The hatched area on the histogram represents the 
total uncertainty on the background estimate including the statistical and systematic uncertainties 
added in quadrature. 
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Figure 8. Distributions of relevant quantities for the 2£ analyses using the 8 TeV data sample: (a) 
transverse mass tot in the 2£-DFOS SR, the smallest azimuthal opening angle between a lepton 
and a jet, (b) in the 2i!-SSljet and (c) 2^-SS2jet SRs. The distributions are shown with all the 
selections applied except for the one on the displayed variable. Data (points) are compared to the 
background plus the VH{H^ WW*) (m_f/=125 GeV) signal expectation (stacked filled histograms), 
where the background components are normalised by applying the normalisation factors shown in 
table 7. The hatched area on the histogram represents the total uncertainty on the background 
estimate including the statistical and systematic uncertainties added in quadrature. 
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8.3 Characterisation of the excess and VH signal region splitting 

Table 13 shows the expected sensitivity to the SM Higgs boson with mass mn = 125.36 GeV, 
the observed signal significance Zq for H^WW* decays and the measured jjL value using 
the categories described in section 5. The 3G3SF and 3G1SF0S SRs are further split in 
the likelihood function according to the value of the “BDT Score”, while the 3t-OSFOS 
SR is split into intervals of as discussed in section 5.2.2. The intervals are shown 

in figures 7(a)-7(c). Each of the 2GSS2jet and 2^-SSljet SRs is further split into four 
sub-categories according to the flavour of the leading and sub-leading leptons. For the 
2f-DFOS a single SR is considered. The numbers in table 13 are computed by adding the 
contributions from the ggF and VBF production to the signal component, and the relative 
strengths of VH, ggF and VBF production are fixed to the SM values and constrained 
with their theoretical uncertainties. 


Signal significance Zq 


Observed signal strength n 


Category 

Exp. 

Zo 

Obs. 

Zo 

Obs. 

Zo 


Tot. 

+ 

err. 

Syst. 

+ 

err. 

R 

U 

0.41 

1.9 

1- 

4.9 

4.6 

3.1 

1.1 

0.40 

— 1 - 

2SFOS 

0.19 

0 

1 

— 5.9 

6.8 

4.1 

0.33 



ISFOS 

0.36 

2.5 


■ 9.6 

8.1 

5.4 

2.1 

0.64 

-^- 

zi 

0.79 

0.66 

1— 

0.72 

1.3 

1.1 

0.40 

0.29 


ISFOS and 3SF 

0.41 

0 

1 

— 2.9 

2.7 

2.1 

1.2 

0.92 

1 

OSFOS 

0.68 

1.2 

1- 

1.7 

1.9 

1.4 

0.51 

0.29 



0.59 

2.1 

1- 

3.7 

1.9 

1.5 

1.1 

1.1 

H— 

DFOS 

0.54 

1.2 


2.2 

2.0 

1.9 

1.0 

1.1 


SS2jet 

0.17 

1.4 

1 

7.6 

6.0 

5.4 

3.2 

3.2 


SSljet 

0.27 

2.3 

1- 

8.4 

4.3 

3.8 

2.3 

2.0 

1 


0 1 2 3 -10-8 -6 -4 -2 0 2 4 6 8 10 12 14 16 


Table 13. The signal significance Zq, and the iJ—> WW* signal strength evaluated in the signal 
regions, combining the 8 TeV and 7 TeV data. The expected (exp.) and observed (obs.) values 
are shown. The two plots represent the observed significance and the observed fx. In the fi plot 
the statistical uncertainty (stat.) is represented by the thick line, the total uncertainty (tot.) by 
the thin line. The first entry in each group (in red) indicates the combination of more than one 
category. All values are computed for a Higgs boson mass of 125.36 GeV. 


8.4 Signal significance extraction and determination of signal strengths 

The VF7-targeted categories are then combined with the categories of the ggF and VBF 
analysis using H —?■ WW* —>• decays described in ref. [22]. The combination is 

again performed by building a likelihood function that includes the SRs and CRs of the 
ggF, VBF and VH analyses. The experimental and theoretical uncertainties affecting the 
same sources are correlated among different production modes. The /r values for each 
production mode (/UggP, AtVBF) IJ^vh) are correlated in all categories and fitted together 
while the background NFs are uncorrelated among the different analyses as they cover 
different phase-space regions. Therefore, when extracting pyjy, ^xwH and ^ZH, the ggF 
and VBF productions are treated as background and their yields determined by the global 
fit. 
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The fit results for the /x values for the WH, ZH and VH production are: 


fiWH = (statO^oi (sys-)> fJ-ZH = 5.1+1^ (statOl^g (sys.), 

HVH = (statO^a? (sys-)- 

The uncertainties on the ^VH value are shown in table 14 (see section 7 for their 
description). The derivative of ^vh with mu has been evaluated to be -5.8 %/GeV at 
ruH = 125.36 GeV. 


Uncertainties on the signal strength fJ-vH (%) 


Signal theoretical uncertainties 



+ 

- 

VH acceptance 

11 

7 

Higgs boson branching fraction 

7 

4 

QCD scale 

1.6 

0.7 

PDF and 03 

3.2 

1.5 

VH NLO EW corrections 

2.5 

1.2 

Background theoretical uncertainties 



QCD scale 

10 

9 

PDF and 03 

2.3 

2.0 

VVV id-factor 

3.0 

3.0 

MC modelling 

7.5 

6.9 



Experimental uncertainties 



Jet 

14 

9 

soft term 

3.4 

2.3 

Electron 

4.8 

2.9 

Muon 

4.8 

3.2 

Trigger efficiency 

1.7 

0.9 

6 -tagging efficiency 

4.7 

3.2 

Fake factor 

14 

12 

Charge mis-assignment 

1.1 

1.0 

Photon conversion rate 

0.8 

0.7 

Pile-up 

3.0 

1.9 

Luminosity 

5.4 

3.3 

MC statistics 

8 

8 

CR statistics 

18 

15 

ggF SR statistics 

5.5 

4.4 

VBF SR statistics 

1.9 

1.5 

ggF+VBF CR statistics 

10 

9 


Table 14. Percentage theoretical and experimental uncertainties on the observed VH signal 
strength jSvH- The contributions from signal-related and background-related theoretical uncer¬ 
tainties are specified. The “VH acceptance” is evaluated using both the qq —t {W/Z)H and the 
gg —t ZH production. The statistical uncertainty due to the ggF and VBF subtraction measured 
in the categories of the ggF and VBF analysis are indicated with “ggF SR statistics” and “VBF 
SR statistics”, for the contribution from the signal regions, and “ggF+VBF CR statistics” for the 
contribution from the control regions. The row “MC statistics” shows the uncertainty due to the 
statistics of the simulated samples. The values are obtained from the combination of the 8 TeV and 
7 TeV data samples. 
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Figure 9. The value of the test statistic as a function of ^wh and uzh, for ttih = 125.36 GeV. 
The contours correspond to the values of {fJ-wH, fJ-zn) associated with the 68%, 90% and 95% 
confidence levels. The black cross indicates the best fit to the data and the open circle represents 
the SM expectation Mzh)=(1,1)- 


Figure 9 shows the value of the test statistic as a function of and ^izH] as shown, 
the correlation between the two parameters is weak. 

Table 15 summarises the signal strengths for each production mode and their com¬ 
bination at a value of mu = 125.36 GeV, together with the observed and the expected 
Zq. The combined signal strength is /x = 1.16^Q}5(stat.)^Q}5(sys.), and the significance 
of the excess respect to the background only hypothesis is 6.5 u, while the expected sig¬ 
nificance in the presence of a Higgs boson decaying to WW* is 5.9 a. Figure 10 shows 
the value of the test statistic as a function of the signal strength of each production mode 
{fJ^ggF, fJ-YBF, IJ-vh) and as a function of the combined signal strength finww- 

The obtained values are all compatible with the SM expectation within 1.4 standard 
deviations. Figures 11(a) and 11(b) show the two-dimensional dependence of the likelihood 
on fivH and figgF, and on ^VH and /rvBF- The /r values not displayed are kept as free 
unconstrained parameters in the fit. The correlation between the parameters shown in 
figure 11 is small. The central values obtained for /Xggp and /xvbf are slightly different 
from those reported in ref. [22]. The shift represents a few percent of the quoted errors 
and is pulled up by the presence of a small contamination by VH{H^WW*) events in 
the 1-jet and 2-jets categories of the ggF and VBF analysis. 
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Figure 10. The value of the test statistic as a function of the ^ value from the different production 
modes (a) ggF, (b) VBF, (c) VH and (d) all combined. All values are extracted from the combined 
fit. The best fit values are represented by the markers at the likelihood minima, with the ±1 (t and 
±2cr uncertainties given by the green and yellow shaded bands. 


Category 

Signal 

significance Zq 





Observed signal strength p. 

Exp. 

Zo 

Obs 

Zo 

Obs. 

Zo 

4 

Tot. 

+ 

err. 

Syst 

+ 

.err. p 

ggF 

4.4 

4.2 

1 - 

0.98 

0.29 

0.26 

0.22 

0.18 + 

VBF 

2.6 

3.2 

1 - 

1.28 

0.55 

0.47 

0.32 

0.25 

VH 

0.93 

2.5 


3.0 

1.6 

1.3 

0.95 


WH only 

0.77 

1.4 

1— 

2.1 

1.9 

1.6 

1.2 


ZH only 

0.30 

2.0 


5.1 

4.3 

3.1 

1.9 



ggF+VBF+VH 


01234567 


Table 15. The signal significance Zq, and the signal strength ^ evaluated for the different 
production modes: ggF, VBF and VH for mn = 125.36 GeV, for the 8 TeV and 7 TeV data 
combined. The two plots represent the observed significance and the observed In the /r plot the 
statistical uncertainty (stat.) is represented by the thick line, the total uncertainty (tot.) by the 
thin line. Combinations of different categories (in red) are shown too. All values are computed for 
a Higgs boson mass of 125.36 GeV. 
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Figure 11. Likelihood as a function of the production mode signal strengths in two-dimensional 
planes of (a) ^vh vs /iggF and (b) ^vh vs ^vbf- The black cross indicates the best ht to the data 
and the open circle represents the SM expectation (1,1). 


8.5 Measurement of the couplings to vector bosons and fermions 

The values of /Xggp, /Uvbf and ^VH can be used to test the compatibility of the bosonic 
and fermionic couplings of the Higgs boson with the SM prediction using the formalism 
developed in ref. [21]. Assuming the validity of the SU(2) custodial symmetry and a 
universal scaling of the fermion couplings relative to the SM prediction, two parameters 
are defined: the scale factor for the SM coupling to the vector bosons {kv) and the scale 
factor for the coupling to the fermions {kf). Loop-induced processes are assumed to scale 
as in the SM. The H ^ tt contribution is treated as signal and its yield is parameterised 
as a function of ny and Kp- The total width of the Higgs boson can be expressed as the 
sum of the different partial widths, each one rescaled by the square of the appropriate 
scaling factor. Neglecting the small contribution from T[H —?■ 77 ) and rarer decay modes, 
the WW* decay branching fraction is expressed as: 


Br(F^ WW*) 


_ kITsm{H^WW*) _ 

4rsM(^ ^ //) + 4 rsM(H- ^ gg) + 4 rsM(^ ^ hh) ’ 


where rsM(H^ —)• //), Tsm{H —)> gg) and Tsm{H —)• VV) are the SM partial decay widths 
to fermions, gluons and weak bosons, respectively. 

The ggF {gg —)• H) process depends directly on the fermion scale factor through the 
top and bottom quark loops, while the VBF {qq —)• Hqq) and VH{qq VH) production 
cross sections are proportional to Ky, as expressed by the following relations: 


< 7(55 ^ H) = H), a{qq Hqq) = Kyasuiqq ^ Hqq), 

a{qq WH, ZH) = nlasuiqq ^ WH, ZH). 
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where the a without subscript indicates the {kv, K_F)-dependent cross sections and usm 
represents the SM cross sections. The gg —)■ ZH production cross sections are more complex 
functions of both Ky and kf [80]: 

<^{99 —^ ZH) 8 TeV = (0.37 X K% - 1.64 X KF X Ky + 2.27 X Ky) asM{99 —^ ZH) g TeV, 

a{gg ZH) 7 TeV = (0.35 x Kp - 1.58 x Kp x Ky + 2.24 x Ky) fTsM(fl'5' ZH) 7 TeV, 

The signal event yield is expressed as a ■ Br(77—)• ITIT*) using the narrow-width approxi¬ 
mation. Only the relative sign between Ky and Kp is observable and hence in the following 
only Ky > 0 is considered, without loss of generality. 

Sensitivity to the sign results from negative interference, in the gg —)• ZH process, 
between the box diagram in which both the Z and H bosons are produced directly from 
the heavy-quark loop and the triangle diagram in which only the Z* is produced and 
subsequently radiates a Higgs boson [81]. Because the relative weights of such processes 
depend on the y/s of the interaction, different coefficients appear in the expression for 8 
and 7 TeV. 

The likelihood dependence on Ky and Kp is shown in figure 12. The product (T{gg —)• 
H) ■ Br(77—)■ kklT*), which is measured with good accuracy, does not depend on jKi?] in 
the limit jK^’j ;§> Ky. This explains the low sensitivity to high values oi Kp. 

On the other hand /ivBF and gyp-, as measured for the 77—)- WW* decay, should vanish 
in the limit Ikt?] ;§> Ky due to the increased value of the Higgs boson total width and the 
consequent reduction of the H^WW* branching fraction. The observation of significant 
excesses in the VBF and VH production modes therefore leads to an exclusion of the 
Ikt?] Ky region. 

The fit to the data results in two local minima and, although the negative Kp solution 
is preferred to the positive solution at 0.5(T, the observed results are compatible with the 
SM expectation, and the best fit values are: 

1^1 _ n Oi:-I-0.26 | | _ i Ofi+O-lO 

\KF\ — O.O 0 _g 2 g, |Ky| — i.UD_gjg, 

and their correlation is p = 0.54. 

9 Conclusions 

A search for the Standard Model Higgs boson produced in association with a W or Z 
boson and decaying into WW* is presented. Associated WH production is studied in the 
final states in which the three W bosons decay to leptons or where one W boson decays 
to hadrons while the others decay leptonically. The two-lepton and four-lepton final states 
are used to search for ZH production. The dataset corresponds to integrated luminosities 
of 4.5 fb“^ and 20.3 fb“^ recorded by the ATLAS experiment with LHC proton-proton 
collisions at = 7 TeV and 8 TeV, respectively. For the Higgs boson mass of 125.36 GeV, 
the observed (expected) deviation from the background-only hypothesis, which includes the 
Standard Model expectation for H —)> rr, corresponds to a significance of 2.5 (0.9) standard 
deviations. The ratio of the measured signal yield to its Standard Model expectation for 
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Figure 12. The likelihood scan as a function of Ky and kf both with and without the 
VH{H^ WW*) contribution. Both the expected and observed contours corresponding to the 68%, 
and 95% C.L. are shown. The yellow star and circles indicate the best fit values to the data, and 
the white cross represents the SM expectation [ny, kf)=(1,1)- 

the VH production is found to be /ivH = (stat.)^Q'y (sys.). A combination with the 

gluon fusion and vector boson fusion analyses using the H —)■ WW* —)• iulu decay is also 
presented. Including VH production the observed significance for a Higgs boson decaying 
to WW* is 6.5 a with an expectation of 5.9 a for a Standard Model Higgs boson of mass 
niH = 125.36 GeV. The combined signal strength is /r = 1.16^Q}5(stat.)lQ}5(sys.). The 
data were analysed using a model where all Higgs boson couplings to the vector bosons 
are scaled by a common factor Ky and those to the fermions by a factor Kp. They are 
measured as \kv\ = 1.06^g}Q and IacfI = 0.85lg2o. 
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